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1. Introduction 

We consider the following version of the Burkholder-Davis-Gundy inequality [8], 

[ 10 ]: 

Theorem 1.1. There is a constant C > 0 such that, for every bounded stopping 
time t, we have 

E[rd CE[B*(t)]. (1) 

Here (B(t)) t ^ o denotes a standard Brownian motion, starting at B( 0) = 0. 

’Supported in part by the Austrian Science Fund (FWF) under grant P25815 and P28661 
and by the Vienna Science and Technology Fund (WWTF) under grant MA14-008. 

TSupported in part by the Austrian Science Fund (FWF) under grants P26736 and Y782- 
N25. 


1 



W. Schachermayer and F.Stebegg/The Sharp Constant for the BDG inequality 2 
By B*(t) we denote the corresponding running maximum of the absolute value 

B*(t) := sup \B(u)\. 

O^u^t 

It is obvious that the set of constants C which satisfy inequality (1) is a closed, 
unbounded interval in R + . By the results of [5] it is known that C = | is 
contained in this set. To the best of our knowledge, this is the smallest constant 
known in the previous literature. In the present paper we establish the optimal 
value for this constant. 

Theorem 1.2. There is an ordinary integro-differential equation (see (31) be¬ 
low) depending on real parameters C > 0 and to > 0 such that C satisfies (1) if 
and only if there is to such that this equation has a well-defined solution. 

Numerical solutions of the equation (31) reveal that the smallest such C, 
i.e. the optimal constant in the Burkholder-Davis- Gundy inequality (1), equals 

C « 1,27267.... 

The paper is organised as follows. As usual in stochastic control theory, we 
first introduce the value function of the optimal stopping problem which cor¬ 
responds to the inequality (1). After some structural facts about the stopping 
problem we turn to some analytic properties of the value function in Section 3. 
We deduce the OIDE (ordinary integro-differential equation) which is referred 
to in Theorem 1.2. The subsequent section is devoted to properties of solutions 
to the fundamental OIDE (31) which are needed to identify these solutions with 
the value function of the stopping problem in Section 5. 

The critical to > 0 associated to the optimal constant C via (31) below also 
turns out to be of somewhat independent interest: if p denotes the first moment, 
say after t = 1, when t is bigger than cB*(t) 2 1 then E[p 5 ] is finite or infinite 
depending on whether c is smaller or bigger than to (Proposition 5.6 and 5.7). 
In Section 6 we state a pointwise version of the BDG inequalities and in Section 
7 we briefly discuss the case of general 0 < p < 2 without entering into a 
numerical analysis. Finally, in Section 8 we discuss the fact why the constant 
C = \/3 which was established by D. Burkholder [7] as the optimal constant for 
(1) in the case of martingales which are not necessarily continuous, is different 
from the present constant C = 1,27267... which holds true for continuous 
processes. We relate this discrepancy with a certain lack of concavity of the 
value function. 


2. The Value Function of an Optimal Stopping Problem 

Fix a constant C > 0. Following a well-known path in optimal control theory 
we define the value function 

V(t,b,b*):= sup E (t ’ MHS) [r5 -CB*(t)], 
reT(t) 


( 2 ) 
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where T{t) denotes the set of bounded stopping times r ^ t and E( t,h ’ & *) denotes 
the expectation conditionally on starting the Brownian motion B at time t with 
the values B t = b , Bf = b*. The domain of definition of V is 

D = {(t,b,b*): 0 < i < oo, 0 ^ |6| b* < oo}. (3) 

Equivalently we can write 

V(t, b , b *) := supEfVtTV - C(b* v (b + B(t))*], (4) 

reT 

which follows from the strong Markov property and stationarity of increments 
of Brownian motion. 

Denote by C the infimum of C > 0 such that (1) holds true. Clearly C still 
satisfies (1). If C < C then V(t,b,b*) = oo, otherwise we have: 

Lemma 2.1. Let C C then V defined via (2) is 

(i) continuous, 

(ii) finite-valued, and 

(in) 1 1 —> V(t, b, b*) — \ft is decreasing for fixed |6| ^ 6*. 

In particular (ii) follows from the bounds 

Vt + Cb* >V(t,b,b*)^y/t-Cb*. (5) 

Proof. The lower bound of V follows from choosing the stopping time t e T(t). 
For the upper bound observe that we can estimate for an arbitrary reT 

VtTfi - C{b * V {b + B(t))* ^Vt + Vr-C{b + B(t))* 

< Vt + ypr — CB(t)* + C\b\ 

^Vt + Cb* +s/t-CB(t)*. 

Taking expectations we get the upper bound for V from the representation in 
(!)• 

Next observe that for r e T we have for t < t' 

Vt+r — C{b* v (b + B(t))* — Vt < \/t' + t — C(b* v (b + B(t))* — Vtf 

by concavity of the square root. Now (iii) follows by taking expectations suprema. 
For (i), please refer to Sections 7 and 9.2 in [14]. □ 

To exclude the trivial case, we assume in the sequel that C ^ C. 

For fixed C, the stopping region S ^ D and the non-stopping region NS ^ D 
are defined by 

S:= {V = t^~ -Cb*}, NS := {V > t^ — Cb*}. (6) 

To characterize the stopping region S first note that it is certainly not a good 
idea to stop when \B(t)\ < B*(t). 
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Lemma 2.2. Let (t,b,b*) e D with \b\ < b*. Then (t,b,b*) e NS. 

Proof. Consider the first exit time of the interval [—6*, b*]. □ 

Next we observe a useful scaling property of V (compare Burkholder [7]). 
Lemma 2.3. For a > 0 and ( t , b , b *) e D, we have 

V{a\ab,ab*) = aV(t,b,b*). (7) 


Proof. This follows directly from the scaling property of Brownian motion: if 
(B t ) t j,o is a standard Brownian motion, then (a 1 B a 2 t ) t - s , 0 again is a standard 
Brownian motion. Also, a random time a 2 r is a stopping time for the first 
process if and only if r is a stopping time for the second process. fj 

This allows us to derive the following Lemma where the first part is a direct 
consequence of Lemmas 2.1 (iii) and 2.3 and the second part is technical and 
deferred to the appendix in Lemma B.3. 

Lemma 2.4. Let 0 < t ^ t' and b e R. Then ( t , b, |6|) e S implies (t r , b , |6|) e S. 

Hence, for fixed C 3 s C there is a smallest to e [0, °o] such that ( t,b , |6|) e S 
if and only if 


t 

w 


P to- 


( 8 ) 


In fact, we have to e (0, oo). 

The next result is a standard result in optimal control theory and also intu¬ 
itively rather obvious. Again, the proof is deferred to the appendix. 

Lemma 2.5. Suppose C IP C and let ( t,b,b *) be in the non-stop region NS. 
Consider a Brownian motion (B(u))t^ u starting at time t conditionally on 
B(t) = b and B*(t) = b*. Let r be the first hitting time of the stopping re¬ 
gion S, i.e. 


t = inf{it ^ t : (u,B(u),B*(u)) e S} (9) 

Then the value process stopped at time r 

M(u) T := V{u a t, B(u a r), B*(u a t)), ulPt, (10) 

is a martingale. 

The unstopped value process 

M(u) := V(u, B(u), B* (it)), u^t, (11) 

still is a supermartingale. 

We conclude this section with a minor technical remark. In the above state¬ 
ment, as well as in most of the paper, we follow the usual language of optimal 
control theory to condition on the event {B(t) = b , B*(t) = b*}. As this is a null 
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set under P this procedure needs some proper interpretation in order to make 
it rigorous. 

Let us now introduce some notation to make this a bit clearer. 

We denote by (J r (zi)) u> o the (right-continuous, saturated) filtration gener¬ 
ated by the Brownian motion (B(u)) u ^o- Of course, in definition (2) the stopping 
time r e T(t) is understood with respect to this filtration. But it is clear from 
the Markov property that, for fixed (t,b,b*), we may assume that r e T(t) 
depends only on the behavior of the Brownian motion (B(u)) u ^t after time t 
and not on the previous behavior of (B(u))o^ u ^t (except for the requirements 
B(t) = b and B*(t) = b*). 

To formalize this fact, we denote by the (right-continuous, satu¬ 

rated) filtration generated by ( B{u ) — B(t)) u ^ t . A stopping time r e T{t) (i.e., 
with respect to the filtration (J r (u))„^o) then may also be considered as a ran¬ 
domized stopping time with respect to the filtration (C/W the random¬ 

ization given by the trajectories of (B(u))o^ U! ^f As (f3(it))o^u^t is independent 
of the filtration (G^ (u)) u ^ t , we conclude that the value of (2) does not change 
whether we optimize over the randomized or the non-randomized stopping times 
with respect to the filtration (£/W For an introduction to the notion of 

randomized stopping times, please refer to [3] 

The bottom line of these considerations is that we may assume w.l.o.g. in (2) 
that r e T(t) is a stopping time with respect to the filtration {G^ (u)) u ^t- 

Now, the statement Lemma 2.5 could be rephrased without referring to con¬ 
ditioning on a null set, by noting that r is a stopping time with respect to the 
filtration (G^(u))t>su- 

All other statements in the paper referring to conditioning on the values B{t ) 
and B*(t) could be made rigorous in an analogous way if the reader insists, but 
we do not further elaborate on these technicalities. 

3. The Value Function from an Analytic Perspective 

Again fix C ^ C. Differentiating the scaling equation (7) with respect to a and 
setting a = 1 we obtain, at least formally, the PDE 

2 tV t + bV b + b*V b * = V. (12) 

The optimal constant C for inequality (1) will be determined by analyzing 
whether this PDE has a reasonable solution for given C > 0 or not. 

We need some preparation. For 0 < h < 1 we denote by f h {s) the density 
of the distribution of the stopping time p h = inf {t : \B(t)\ = 1}, where B is a 
Brownian motion starting at -B(O) = 1 — h. 

Define 

f h ( s ) 

g(s ) = lim — : -—, s > 0. 

’ h\o h 

It is well-known (e.g. [11, Exercise 2.2.8.11]) that there is an explicit repre¬ 
sentation of f h (s ) as an infinite sum. By differentiation of each summand we 
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obtain an explicit infinite sum representation also for g(s) (see the appendix 
below). 

The function g appears in the formulation of the subsequent lemma which 
will turn out to be of crucial relevance for our analysis. 

Lemma 3.1. Let W : D —* R + be a continuous function such that 


(a) b* i—> W(t,b,b*) is Lipschitz continuous and 

(b) IT(ft, 1,1 ) — y/t is decreasing. 

Furthermore let S <= D be defined by Sw '■= {(t, b, |6|) e D : t/b 2 ^ tw} 
for some fixed tw ■ Consider a standard Brownian Motion B (ft) and define tw 
to be the first hitting time of Sw Suppose that X(t) := W(t,B(t),B*(t)) is a 
supermartingale and X(tATw) is a martingale. Further assume that the process 
X(t a tw a a h ) is uniformly integrable where o h is given by <r h := inf{s ^ t : 
\B(s)\ = l + h}. 

Then, 

(i) 

W 6 *(ft,l,l) := lim \[W{t,l,l + h) -W(t,l,l)] = 0 (13) 

o h 

for 0 < t < tw 


(ii) 


W b (t, 1,1) := lim ~[W(t, 1,1) - W(t, l-h,l] 
h \o h 


(14) 


r 


[W{t + s, 1,1) -W(t, 1, l)].g(s)ds 


Observe that (i) in the above Lemma would follow directly from Ito’s formula 
if we assume that W is sufficiently differentiable by considering 


dX{t) = ( W t + \W bb )dt + W b dB{t) + W b *dB*(t) (15) 

which is the increment of a martingale. The process dB*(t) is non-decreasing 
and its variation is a.s. singular with respect to Lebesgue measure. A necessary 
condition for (W(t,B(f),B*(f))) Tw ^ t to be a martingale therefore is that W b * 
vanishes a.s. with respect to the variation measure of dB*. This indicates that 
W b *(t,b,b*) = 0 should hold true whenever |6| = b* and ( t,b,b*) is in the 
non-stop region NS. In particular, we should have W b *(t, 1,1) = 0, for t < tw- 


Proof of 3.1. (i) For ft > 0 as in (13) define, conditionally on B(t) = 1 and 
B*(t) = 1, the stopping times 

a h := inf{it ^ t : |-B(u)| = 1 + h} and (16) 

r h := o h a t W - (17) 


Recall that the random variable tw is a stopping time with respect to the 
filtration (G^ {u)) t ^ u - Note that the process ( u,B(u),B*(u )) u ^. t , starting at 
B{t) = 1 and B* (ft) = 1 + h, also remains in D\Sw up to the stopping time tw■ 
To see this, we distinguish two cases: 
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(i) t < u < r h : Here we have B(u) < 1 +h = B*(u) and thus (u, B{u), B*(u)) 6 

D\Sw- 

(ii) r h < u < Tw : In this case B*(u) is the same for B*(t) = 1 and B*(t) = 

1 + h. 

As t < to we have that r is a.s. strictly positive. This implies that 

lim P [a h = r h ] = 1, (18) 

h\fl 

as lim/jx^g a h = t, almost surely. 

We may write 

W(t, 1,1 + h) - W(t, 1,1) (19) 

= E^ 1 ’ 1 +h '>[W(T h ,B(T h ),B*{T h ))]-E^ t ’ 1 ’ 1 '>[W(T h ,B(T h ),B*{T h ))] 

= \W{r h , B(r h ), 1 + h)] - E (M ’i)[W(r h , B(r h ), B*(r h ))]. 

Here we use that B*(r h ) = 1 + h conditionally on B*(t) = 1 + h as T h < a h by 
definition. Furthermore we make use of the martingale property of W ( t , B(t ), B* (t'j) 
and the optional stopping theorem. For the use of the latter we need the as¬ 
sumption that (W(u a r h , B(u a r h ), B*(u a T h ))) u ^ t is uniformly integrable. 

On the set {a h = r h } we have B*(r h ) = B*(a h ) = 1 + h for both initial 
conditions B(t) = 1 ,B*(t) = 1 + h and B(t) = 1 ,B*(t) = 1. Therefore, the 
value W(r h , B(r h ), B*(r h )) is the same under both initial conditions. It follows 
that 


E (t,i,l+fc)[ W -( T /. j B ( T h ), 1 + h)l {ah=Th} ] 

~^ t ’ 1 ’ 1 '>[W(T h ,B(T h ),B*(T h ))l W u =T H } \ = 0. 

On the remaining set {r h < a h } we have B*(r h ) e [1,1 + h]. Because W is 
Lipschitz continuous in the variable b* with some constant L we may estimate 

E (M,1 +h) [W ( T h jB (T h ),l+h)l {T H <<Th} ] 

-E^ 1 ' 1 '>[W(T h ,B(T h ),B*(T h )l {T H <<J H } ] < LhP[r h < a’ 1 ]. 

Dividing (19) by h and passing to the limit we obtain from (18) that 

Wb*(t, 1,1) := lim i[W - (t,l,l + /0-W'(t,l,l)] =0 
o h 

(ii) As A' is a martingale before hitting Sw we have for p h as above that 

rCC 

W(t, l-h,l)= e^-M \W{p h , 1,1)] = W(t + s, 1,1 )f h (s)ds, 

Jo 

where the density f h is given by 
1 


roo == 


a/ 27rs 3 


s 


(An + h) z , _ x (An + 2-h) z 

(4 n + h)e 2 = + (4n + 2 — h)e 2 » 


■ ( 20 ) 
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We can use this relation to calculate the derivative w.r.t. the second component: 

1 f 00 

-W b (t, 1,1) := lim - [W(t + s, 1,1) - W(t, 1,1 )]f h {s)ds 
h\ o h J 0 

We split this integral into two parts at some point a > 0 and observe that 
f h (s ) is continuous and f h (s) \ 0 for h \ 0 pointwise at s > 0 and thus by 
Dini’s Theorem also uniformly (monotone) on any interval [o, K ], for 0 < h < 
ho(a). Therefore we have 


lim 

h\0 



+ s,l,l)-W(t,l,l)]f h (s)ds 


( 21 ) 



+ s, 1,1) - W{t, 1, l)].g(s)(is 


for g given by 


g(s) := hm ^ 
h\0 h 


\/2tts 3 


1 + 2- £(-i y 



( 22 ) 


As before, because u >-» W(u, 1,1) — \/u is decreasing and s h-» y/t + s — \Jt is 
concave, we have W(t + s, 1,1) — W(t, 1,1) < y/t + s — yft < Therefore 

sf h (s) 

the integrand is dominated by . 

For K > 0 we can then estimate 


1 

h 



jV[p h > K]E[(B ph - B 0 ) 2 \p h > I<] 


= P [p h > K](2 - h) ^ 2 P[p h > K). 


This probability tends to 0 uniformly in K , thus the integrals over [I\ , oo) can 
be neglected and we can replace K by oo in (21). 

For the other part of the integral we first observe that 

lim ^ f sf h (s)ds 

h\ o h J 0 w 

If 00 If 00 

= lim — sf h (s)ds— — sf h (s)ds 
h\o h Jo W hj a W 

1 If” 

= lim —— — sf h (s)ds 

h \o h 1 J h J a v ’ 

i pGC /'GO 

= lim 2 — — — sf h (s)ds = 2 — sg(s)ds. 

h J a J a 

The last integral converges to 2 for a —» 0 by monotone convergence. 

We conclude by setting M ^ \Wt{u, 1,1)| for u e [t ,t + a], making the 
estimate 
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1 

h 



[W(t + s,l,l) -W(t,l,l))f h (s)ds 



and then taking the limit for a —> 0. 


a. 


We can now apply this technical Lemma to our value function V by checking 
that the assumptions of the previous Lemma are satisfied by the value function 
V: 


Lemma 3.2. Let V : D — > K. be the value function for (2). Then, 




(ii) 


V b *(t, 1,1) := lim l[V{t, 1,1 + h) - V(t, 1,1)] = 0 (23) 

h\0 ri 

for 0 < t < to 


V b (t, 1,1) := lim ~[V(t, 1,1) - V(t, l-h,l\ 

h \o h 


rOD 

[V{t + s, 1,1) - V{t, 1, l)]g(s)ds 
Jo 


(24) 


Proof. V is continuous by Lemma 2.1 and Lipschitz-continuous in b* by defi¬ 
nition. We also have V(t, 1,1) — \ft is decreasing by Lemma 2.1. Furthermore 
V(t,B(t),B*(t)) is a martingale up to hitting S by Lemma 2.5. Thus, setting 
W = V, Sw = S and tw = to, it remains to check the required uniform inte- 
grability condition. We have 


0 HV(u a r\ B(u a T h ),B*(u a r h )) 

^ V(t, B{u A T h ),B*{u A T h )) + (VU A T h — yft) 

K:V(t,Q,l + h) + Vr'* < V(t, 0,1 + h) + ^ + r h . 


where the first estimate follows because the function u i-» V(u,b,b*) — y/u is 
decreasing in u. The second inequality is due to the fact that V is decreasing in 
|fe| and increasing in b* as well as B*(u a r h ) < 1 + h. Now, r h ^ a h and a h 
has exponential moments and is therefore integrable which yields the desired 
uniform integrability. 

Observe that r h is smaller than the first hitting time of the non-stop region 
NS no matter whether we condition on (t, 1,1) or (t, 1,1 + h ) which warrants 
the use of Lemma 2.5. □ 


The subsequent lemma shows that, for t > to, the behavior of V b and V b * 
follows a different pattern than the one given by Lemma 3.2. We find that 

V b *(t,l,l) + V b (t,l,l) = -C (25) 

where we have to interpret this equation properly. 
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Lemma 3.3. For t > t 0 we have 

lim — [V(t, 1 + h, 1 + h) — V(t, 1,1)1 = —C. 

h\o h 

Proof. For t > to we have 

V(t,b*,b*) = i* -Cb*, 


(26) 


for b* in a neighbourhood of 1. □ 

To abbreviate notation we shall sometimes denote by V(t) the function 
V(t, 1,1) (recall that we keep C S 5 C and the corresponding t 0 = to(C) fixed). 
We thus obtain the following integro-differential equation for V(t). 

Lemma 3.4. The function V(t) satisfies the following equations 

2 tV\t) = V(t) +C, t> t 0 , (27) 

rOD 

2 tV\t) = V(t) + [V{t + s) - V(t)]g(s)ds, t < t 0 . (28) 

Jo 

Proof. The first assertion is obvious, as we have 

V(t)=t?—C, for t > to. (29) 

The second equation follows, at least formally, from (12), (23) and (24). 

To justify (28) in a more pedantic way, note that for a > 1 we obtain from 

(7) 


aV(t, 1,1) - V(t, 1,1) = V{a 2 t, a, a) - V(t, 1,1) 

= (V ( a 2 t , a, a) — V ( t , a, a)) 

+ (V(t,a,a) - V(t,l,a)) 

+ (V(t,l,a)-V(t,l,l)). 

Dividing by a — 1 and letting a decrease to 1, we deduce (28) from Lemma 
3.2. □ 

Let us discuss the behaviour of the function V(t) at t = to- As observed in the 
previous section, V(t) is continuous so that we must have “continuous pasting” 
at to- It is the immediate reflex - at least it was so for the present authors - 
to expect smooth pasting of V(t) at t = t 0 i.e. lim^t,, V'(t) = lim ty to V'(t). 
By (27) and (28) this would result in determining to by equating C with 
^(V(t 0 + s) — V(t 0 ))g(s)ds. To our big surprise this turned out not to be 
the case; after some time of reconsidering we had to conclude that there is little 
reason why the smooth pasting principle should prevail in the present context. 
Here is one intuitive reason: for a fixed number to > 0 we have that for almost all 
trajectories of a Brownian motion B = (B(t)) t ^ 0 , starting at B( 0) = 0, there is 
no t > 0 such that the two equalities |i?(£)| = B*(t) = (^) 5 are simultaneously 
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verified. By Lemma 2.4 we conclude that a discontinuity of the derivatives of 
V(t,B(t),B*(t)) can only take place where these two equations are simultane¬ 
ously satisfied. Roughly speaking: the Brownian motion B “does not see” a kink 
of the function V(t) at t = to- 

As a matter of fact, this natural example of a case of non-smooth past¬ 
ing in the case of continuous martingales seems to us a remarkable feature 
of the present paper. The literature on non-smooth pasting is generally revolv¬ 
ing around non-continuous processes. Some examples of non-smooth pasting for 
processes with jumps can be found in [1], [2], [6], [9] and [14]. 


4. The Integro-Differential-Equation 


Fix the parameters O > 0 and to > 0. We consider the ordinary integro- 
differential equation for the function U = U c,t ° 

U(t)=t%—C, t^to, (30) 

rCC 

2tU'[t) = U{t) + [U(t + s) — U(t)]g(s)ds, 0 <t^t 0 , (31) 

Jo 


where g is given by (22). 

Here the fixed behaviour (30) of U(t), for t ^ to, is considered as the initial 
condition, and subsequently the OIDE (ordinary integro-differential equation) 
(31) is solved by letting t decrease from t 0 to 0. For t = t 0 , the derivative U'(t 0 ) 
in (31) is understood as the left limit of U'(t) when t increases to to- 

It is standard to verify that, for C > 0, to > 0, and e > 0 the solution U C ’ to of 
(30) is well-defined for t e [e, go) and depends smoothly on the parameters O and 
to- On the other hand, the 2t term on the left hand side of (31) indicates that 
only for special cases of O and t 0 this solution can be extended to a continuous 
and finitely valued function U(t), defined for all t e [0, oo). 

The evidence resulting from our numerical analysis of the solutions (U c,t ° (t))t>o, 
in dependence of O and to, can be resumed as follows: 


Numerical Evidence 4.1. 


(i) 


(ii) 

(iii) 


There is a smallest number O > 0 as well as a unique number to > 0 
such that the OIDE (31) admits a solution U c,t °(t) which has a finite 
limit lim t _>o U C ’ to (t). This solution is monotone increasing and U C ' to (t ) ^ 

t§-<5. 

For 0 < C < C there does not exist to > 0 such that U(t) = U C ’ to (t) 
remains bounded from below as t \ 0. 

For C > C there are precisely two values t\ = ti(C),t 2 = ti{C) depending 
on C in a continuous and one-to-one way such that U c,ti ( t ) has a finite 
limit, as t \ 0, for i = 1,2. These solutions satisfy U c,ti (t) ^ t? ~ C. 

For t-o e (t\,t 2 ), the solutions of the OIDE (31) tend to +oo, for t \ 0, 
while, for t-o f the solutions tend to —go, for t \ 0. We have 

ti(C) <t 0 < O(O) and lim c _ > gti(0) = limO(O) = t 0 . 
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The functions U c,tl and U c,t2 are monotone increasing and 

U c ’ u (t) < U d ’ Io (t ), for all t e [0,oo[. (32) 

Finally, we find the numerical values 

C * 1.27267... and t 0 * 0.9036... (33) 

We have not been able to provide a mathematically rigorous proof of the 
above assertions and only rely on the numerical evidence (which is based on 
Euler-type simulations in Python with variable step sizes). We therefore consider 
the above statements rather as hypotheses underlying our subsequent results 
and we shall carefully point out in the subsequent statements where we rely on 
this evidence. 

For example, for C = 1.25 < C which is case (ii) above we illustrate the 
situation by Figure 1. 



Fig 1. The subcritical case C < C: Numerical solutions for C = 1.25 and various values for 
the pasting position to in the interval [0.8,1]. The graph underneath is t 1//2 — C through which 
all solutions cut in the subcritical case when they get close to 0. 


For C = 1.274 > C, which is case (iii) above, we find t\ * 0.85... and 
t 2 * 0.95 ..., as illustrated in Figure 2. 


When C decreases to the critical value C * 1.27267... the numerics suggest 
that the length t 2 — 1 1 of the intervals (ti,t 2 ) decreases to zero and that these 
intervals shrink to a single point to e]0,oo[ for which we find to ^ 0.9036.... 
It is convincing from the numerics that the limiting solution U c,t °(t) then is 
well-defined for all t ^ 0 by letting 0) := lim^o (t). This function 
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Fig 2. The supercritical case C > C: Numerical solutions for C = 1.274 and and various 
values for the pasting position to in the interval [0.6, 1.2]. We find that while solutions for 
to in an interval (ti,i2) with ti 0.85 and t 2 as 0.95 stay above the graph oft 1 / 2 — C (in 
fact, they tend to +oo, as t —> 0), the solutions fall to — oo for t —> 0 if to $ [ti, ^ 2 ]- At the 
transition of these two regimes lie the bounded solutions and U c,t2 (which are not 

explicitly displayed in the above figure but are squeezed between the neighbouring solutions 
U c ^). 


U C ’ to (t) is monotone increasing and such that U c,t °(t) lies above the function 
U^ ,0 (t) = t 2 — C. Clearly we expect that U C '*' a must be the “right” solution, 
which may be identified with the value function V defined in (2) for the optimal 
constant C = C and, in particular, that C as 1.27267 ... equals the optimal con¬ 
stant C in the Burkholder-Davis-Gundy inequality (1). We shall subsequently 
deduce this result more formally. 

5. Identifying the Value-Function 

Admitting the Numerical Evidence 4.1 we shall show that the function U c,t °(t ), 
obtained above from the analysis of the OIDE (31), indeed determines the value 
function V(t,b,b*) as defined in (2) for the constant C = C and that this 
constant is indeed the optimal Burkholder-Davis-Gundy constant in inequality 

(I)- 

Starting from a solution U(t) = U C ’ to (t) of the OIDE (31) for parameters 
C > 0 and to > 0 such that U{t) extends continuously to a finite value 17(0) we 
may extend this solution (by slight abuse of notation) to a function U(t,b,b*), 
defined on D, by first letting 

rOD 

U(t,b, 1) := U(t + s)f 1 ~^(s)ds, for 0 < |6| < 1, 

Jo 


( 34 ) 
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where f h (s) is defined in Section 3. For general (t, b, b*) e D we use (7) to define 

U(t,b,b*) = b*u(j±p,± l). (35) 

For later reference, we note that U(t) ^ yft — C implies 

U(t,b,b*)>Vt-Cb*, for all (t,b,b*) e D (36) 

Lemma 5.1. Fix C > 0 and to > 0 such that U(t) extends continuously to a 
finite U{ 0) and admit the Numerical Evidence f.l (i) and (in). Let (B(u)) u ^t 
be a Brownian motion starting at some time t > 0 at B(t) = b and B*(t) = b*. 
The process U(u,B(u),B*(u)) u ^t is then a local super-martingale. It is a local 
martingale up to entering the stopping area S := {(t,b,b*) : |6| = b*,t/(b*) 2 ^ 
to}- 

For the proof we need the following Lemma to justify the use of Ito’s formula 
for a function that is not smooth everywhere but where the Brownian Motion 
hardly ever touches the set where it is not differentiable. 

Lemma 5.2. Let W : D —» K. be a continuous function and tw > 0 such that 

(a) the derivatives W t and W bb exist and are continuous on the interior of D, 
and 

(b) W t + \W bb = 0, 

(c) W b * ^ 0, and 

(d) W b * (t, b, \b\) = 0 for £ < t w . 

Define Sw '■= {(L b, |6|) eD:p^ t\y}- For a standard Brownian Motion B(t) 
let tw be the first hitting time of S for (t,B(t),B*(t)). Then, 

(i) X'(f) := W(t,B(t),B*(t)) is a local supermartingale, and 
(ii) X(t a tw) is a local martingale. 

Proof. This follows at least formally from the assumptions and Ito’s formula 

dX{t) = (W t + \W bb )dt + W b dB(t) + W b *dB*(t). (37) 

To address this in a more formal way, let e > 0 and define the stopping times 
{Pn)n= o by pi = 0 and 

Pn = inf{t : t ^ Pn-i + e and \ B (t)\ = B*(t)}. 

We also denote by A e the union U™=oI Pn+fij which is a predictable subset 
of Ll x M + . Denoting by ^4° = Dn=o^"> ^be set A 0 simply equals {\B(t)\ = 
B*(t)}. Fixing T > 0, the Lebesgue-measure of {w} x [0,T] n A e tends to zero, 
for almost all ui e D. 

Fix a bounded stopping time r such that (B(t))o^t^r remains bounded. It 
follows that W t as well as W bb also remain bounded on A e n [0, r] so that 
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is a martingale and B e = (fl x R + )\A e is the complement of A e . Indeed, it 
suffices to reason on the stochastic intervals \p e n _\ + e, p%\ and to observe that 
B*(t ) remains constant on these intervals. 

Turning to the remaining part 


Y\t) : = f 1 A 'dX(8) = X(t) - X e {t) 

Jo 


Jo 

we shall show that along a sequence these processes tend almost surely to the 
non-increasing process 

F°(t) = r 


(t) = f l A o dX(s) 

Jo 


= f 1 A oW b *(t,B(t),B*(t))dB*(t). 

Jo 


Indeed, the dominated convergence theorem for Ito-Integrals yields convergence 
in probability and thus subsequence convergence almost surely. Fixing this se¬ 
quence of e’s we can take the process to the appropriate limit. □ 

Proof of Lemma 5.1. It follows from definition (34) that, for (t,b,b*) e D such 
that t > 0 and 0 < |6| < 6*, the heat equation 


U t + -U bb = 0 

is satisfied. 

On the boundary, for t < to, we can apply the definition of U to obtain 

i7 6 (t,l,l) + J7 6 *(t,l,l) := lim \[U{t, 1 + h, 1 + h) ~ U(t, 1,1)] 

o h 

= lim + hf) - U(t)] + U(t/(1 + h ) 2 ) 

h\0 (1 

= -2 tU\t) + U(t) 


= - f [U(t + s)~ U(t)]g(s)ds 

Jo 


The last equality is exactly the OIDE (31). Applying Lemma 3.1 (ii) one obtains 
that the last expression is equal to 

U b (t, 1,1) := - lim \[U{t, 1 - h, 1) - U(t, 1,1)], 

h\0 fl 

where all the assumptions of this Lemma are easily checked. Clearly the left and 
right derivatives of U b agree. It follows that U b *(t, 1,1) =0 and more generally 
that U b *(t,b, |6|) = 0 for t/b 2 < to. For t ^ to we can derive as in Lemma 3.3 
that 

r GO 

Ut* (t, 1,1) = — Ub(t , 1,1) — C = [(t + 5 ) i — t?]g(s)ds — C. 

Jo 
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This expression is monotone decreasing in and is necessarily non-positive at to 
so that U(t, 1,1) 12 — C holds. We conclude that, for arbitrary t > 0, t4* 0. 

Having established that 14* < 0 and Ub*(t,b, |6|) = 0 for p < f 0 we nray 
derive, at least formally, the assertion of the present lemma from (57) and Ito’s 
formula as in (15). 

Now, we can conclude using Lemma 5.2 □ 

Let us now observe the following relations between value functions to the 
optimal stopping problem and solutions to the OIDE (31). 

Lemma 5.3. Let V c be the value function as defined in (2) for a constant 
C > 0 that satisfies the inequality (1). Take to = ^o(C) e (0, oo) to be the 
corresponding point separating S from NS (see (6 )). Then V c (t) := V c (t, 1,1) 
satisfies the OIDE (31) for this choice of C and to. 

Proof. For t ^ to, we have V c (t) = ti - C. 

For t < to denote by r h = a h a t the stopping time as in (17) above, condi¬ 
tionally on (t, 1,1). Note that (V c (u,B(u),B*(u))) t ^ u ^ T h then is a uniformly 
integrable martingale. To see this, we make a distinction for u =% v and u > v 
where v is the stopping time v := inf{s : B*(s) 2 to < s}: 

1. u ^ v h := v a T h \ The domain of (( u , B(u),B*(u)) is bounded by |H(u)| ^ 
B*{u) < 1 + h and u < v h < B*[v h ) 2 t 0 < (1 + h) 2 t 0 . Clearly V c is 
bounded on this domain. 

2. u > v h '. Here the properties of V c given in Lemma 2.5 allow us to see 
that one can rewrite the process as 

V c {u,B(u),B*(u)) = e-CH*( u) 

where a is again the first time where B(u) = B* ( u ) holds. Note that this 
holds because B*(v) is now constant for u < v < cr, and it is clearly the 
definition of a uniformly integrable martingale, provided the conditional 
expectation is well defined, which it is by the estimate 

e (u,b(u),b*(«))^Jj ^ u \ + B *( u y 

Hence the formula 

V c (t, 1,1) = E(*d, P[V c (r h ,B(T h ),B*{T h )] 

is justified. This was the only assumption of Lemma 3.1 which is not immediate 
from the definition of V c . Now the claim follows from Lemma 3.1. □ 

Proposition 5.4. Admitting the Numerical Evidence f.l (i) and (ii), the con¬ 
stant C obtained in (33) equals the optimal constant C for (1), and the function 
U c,t °(t,b,b*) obtained in (34) and (35) equals the value function V(t,b,b*) as 
defined in (2) for the constant C = C. 

The value to associated to C by Lemma 2.f equals the constant to in (33). 
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Proof. To show C ^ C, suppose that C > 0 is a constant satisfying the 
Burkholder-Davis-Gundy inequality (1), i.e. suppose that C ^ C. Then by 
Lemma 5.3 we have that V c (t) := V c {t,\,l) satifies the OIDE (31) for this 
choice of C and the corresponding t 0 {C) separating S from NS. 

As V c (t) is increasing in t and satisfies 1^(0) ^ —C we conclude from the 
Numerical Evidence 4-1 (i) and (ii) that C ^ C. This yields C 3 s C. 

To show conversely that C < C consider the function U ct °(t,b,b*). By 
Lemma 5.1 the process (U c,to (t,B(t),B*(t)) t ^ 0 is a local supermartingale. 
Hence we have, conditionally on ( t,b,b *) e D and for each bounded stopping 
time r 3 s t and localizing sequence (r„)® =1 . 

U d ~ to (t,b,b*) 3s E A Tni B{r a T n ), B*(t a t„))] 

> E^’ b ' b *^[(T a r n )3 - CB*(t a t„)] 

where the second inequality derives from (36). In the limit for n —» oo this 
yields U d ’*°(t,b,b*) ^ E(bM*)[ T § - CB*(t)]. Hence U d ’*°(t,b,b*) dominates 
the value function V c {t,b 1 6*) as defined in (2) for the constant C. This shows 
C 3* C as well as U d ’ io ^ V d = V d . O 

We can finally summarize these results to proof the main theorem. 

Proof of Theorem 1.2. Suppose that C = C is the optimal constant for (1) 
and let to e (0, oo) the corresponding critical value given by Lemma 2.4. Then 
V = V{t) satisfies the OIDE (30) and this solution is increasing in t and satisfies 
V(0) Ss -C. 

As shown in the previous section there is a minimal C allowing for such a 
solution, for an appropriately chosen t 0 e (0, oo). This value of C therefore must 
coincide with the optimal value C for the Burkholder-Davis-Gundy inequality 
(!)■ 

Conversely if C is chosen such that for some t 0 e (0, oo) the OIDE (30),(31) 
has a solution on [0,T] then the numerical evidence 4.1 (ii) gives that C ^ C = 
C and so (1) holds. □ 

Remark 5.5. It is interesting to consider, for a fixed constant C > C, the rela¬ 
tion between the value-function V c (t } b, b*) defined in (2) and the corresponding 
solutions of the OIDE (31). In this case the numerical evidence 4.1 (iii) indicates 
that there are two bounded solutions U c,tl {t) and U c,t2 (t). Which of the two 
is the “good one”, i.e. which one equals the value function V c (t, 1,1)? 

To answer this question, first note that, for C > C, we clearly have the 
nronotonicity relation V c (t, b, b*) < V c (t, b, b*). It is also easy to see that 
to{C) < to(C) = to = t 0 , where t 0 (C) is associated to the value function 
V c (t, b, b *) via Lemma 2.4. In other words, the stopping region S for the func¬ 
tion 1 2 — CB*(t) in (2) is bigger than the stopping region S for the function 
ti — CB*{t). 
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It follows from the numerical evidence that the value t\(C) for which we 
have t\(C) < t 0 is the only candidate for the “good” solution while for t 2 (C) 
for which we have t- 2 {C) > to, we cannot have U c,t ^ c \t) = V c (t, 1,1). We can 
conclude from Lemma 5.3 that the value function V c (t, 1,1) indeed equals the 
solution U c ^ c \t) of the OIDE (31). 

The fact that U c,t ^ G \t) cannot be the “good” solution has the following 
consequence which is interesting in its own right (compare [15]). 


Proposition 5.6. Admitting the Numerical Evidence 4-1 (Hi) we 
t 2 > to-, the stopping time 

have that, for 


(38) 

satisfies 


E[p 5 ] = 00. 

(39) 

Proof. Define the stopping time r by 


r mf js 3s 1 : B *^ 2 > *2 and \B(s)\ - 5*(s) j 

(40) 

Clearly r ^ p, as we may equivalently define 


r := inf{s ^ p : B(s) = B*(s)}. 

(41) 


We claim that E[ps] < oo if and only if E[ts] < oo. Indeed, it follows from (41) 
that the law of r — p, conditionally on (p, B(p ), B* (p)), is that of the first hitting 
time a of the level B* (p) by the absolute value of a Brownian motion (W(u)) u ^o 
starting at W(0) = B(p). We may (very crudely) estimate E[cr] < B*(p) 2 . 
Noting that at time p we have B*(p) 2 ^ ^ we may estimate 

E[r-p|p,B(p),B*(p)]<f. 

Hence we obtain 

E[r*] = E[p5] +E[E[r5 - pi\p,B(p),B*(p)]] 

< E[pi] + E[E[(r - p) 5 |p, B{p),B* (p)]] 

^ E[ps] + E[E[r — p|p, B(p),B*(p)]i] 

4 + w) E[A 

which readily shows that E[ps] < 00 . This implies that E[rs] < 00 . 

So let us suppose that E[t5 ] < 00 and work towards a contradiction. 

Define the stopping region S(t 2 ) relative to t 2 as 

S(t 2 ) = {(t,b,\b\) e D : 
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and the corresponding non-stopping region by NS(t 2 ) = D\S(t 2 ). 

We condition on some fixed (1, 6, 6*) e NS(t 2 ). Note that r is the first time 
when leaves NS(t 2 ). 

Admitting the Numerical Evidence 4.1 (iii), associate to t 2 > to the constant 
C > C such that U c,t2 {t) is a solution of the OIDE (31) which remains bounded 
as t \ 0. We write U C ’ t2 (t,b,b*) for its extension defined in (35). In contrast, 
we denote by V c (t, 6,6*) the value function as defined in (2) for the constant 
C. 

The process (U C ’ t2 (t,B(t),B*(t)))i^ t ^T is a local martingale by Lemma 5.1, 
where the present t 2 corresponds to to in the statement of this lemma. 

In addition we show that this local martingale is a uniformly integrable mar¬ 
tingale up to time r, i.e. the family of random variables U C ’ t2 (a,B(a),B*(a)), 
where a ranges in the stopping times 1 a r, is uniformly integrable. Recall 
the scaling relation 

and note that B ^ a y remains in the interval [0, ^ 2 ] so that, by compactness, 

the term \U C ’ t2 ( B * a ^ 2 , > 1)1 remains bounded by some constant M > 0. 

Therefore 

\U C ’ t2 (a,B(a),B*(a))\ < MB*(a) MB*(t). 

If E[t5] < 00 we infer from the Burkholder-Davis-Gundy inequality (this time 
the reverse inequality to (1)) that the random variable B*(t) is integrable. 
Hence the family of random variables U c,t2 {a,B(a) 1 B*{a)) is dominated by 
the integrable random variable MB*(t) which shows that the local martin¬ 
gale U C ’ t2 (t, B(t), B*(t))i^ t ^r is of class D and is thus a uniformly integrable 
martingale. 

Hence, conditionally on (1, b, 6*) e NS(t 2 ) we obtain 

U c ’ t2 {l,b,b*) = E^ 1,6,6 *^!/ 0 ’* 2 (t, B(t), B*(t))]. (42) 

We now pass to the process (V c (t, B(t), B* (t)) t ^ 1 again conditionally on 
(1,5,6*) e NS(t 2 ). By Lemma 2.5 we know that this process is a supermartin¬ 
gale. Repeating the above argument, we obtain that this supermartingale is 
uniformly integrable up to time r. Hence 

H c (l,6,6*) ^ E (1 ’ h ’ b * ) [E c (r, B(t),B*(t))]. (43) 

Noting that at time r we arrived in the stopping region S(t 2 ) we obtain 

U c,t2 (t,B(t),B*(t)) = R c (t,R(t),R*(t)) = t* - CB*(t). 

Hence (42) and (43) yield 

V c {l,b,b*) U c ’ t2 (l,b,b*), 

for all (1,6,6*) e D. As we have seen that V c = U c ' t ^ < U c ^ c \ and 
U°MC) i s no t equal to U c,t2 ^ c \ we arrive at the desired contradiction. □ 
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The above result is complemented by the following estimate in the reverse 
direction. 

Proposition 5.7. Adtnitting the Numerical Evidence 4^.1 (Hi), we have, for 
ti < to, that the stopping time 


satisfies 

E[p5] < oo. (45) 

Proof. Similarly as in the proof of the previous proposition, we define 
r := inf{s ^ 1 : ^ and l 5 ( s )l = 

We shall show that 

E[t 5] < oo (46) 

which will imply (45). 

We condition on (1,6,6*) in the non-stop region NS(t\) defined as in the 
preceding proof, and will show 

E(i.fc.*>*)[ T 5] Kb* (47) 

for some constant K > 0, which will imply (46) by integrating over the values 
B( 1) = 6 and B*{ 1) = 6*. 

We associate to ti < to the corresponding C > C such that the solution 
U c,tl (t) of the OIDE (31) remains bounded (Numerical Evidence 4.1 (iii)). 

Using the Numerical Evidence 4.1 (iii) there is some a > 0 such that the 
solutions U c,tl (t) of the OIDE (31) and the value function V G (t) = U G, *°(t) 
for the optimal constant C = C are separated by some a > 0, i.e. 

V G (t) ^ U c,tl (f) + a, for all t ^ 0. (48) 

Indeed, for t ^ to we have U c,tl = ti—C and U G ’*° (t) = P — C. For t e [0, to] we 
have U c,t °(f) < U C ’ to (t) by (32) so that by compactness we obtain a separating 
constant a > 0. 

More generally, we obtain from (34) 

V G (t, 6,1) ^ U c,tl (t, 6,1) + a, for all (f, 6, 1) e D. (49) 

Similarly as in the above proof we consider, conditionally on (1,6,6*), the 
processes 

and (V G (u, B(u), B*(u)))i^ r . 
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Both are local martingales up to time r. Let (r n )“ =1 be a sequence of local¬ 
izing, bounded stopping times, r n ^ 1, increasing to r. 


v d (l ,b,b*) = E^ 1,6 ’ 6 *) \V d {T n , B(r n ), B*(T n )] 




[b*(t u )V< 


B(t„) 


■’)] 


B* (r n ) 2 7 S*(r Tl ) 7 

[B*(r n )U c * +aE^ b ^[B*(r n )] 

= U CM {l,b,b*) + aE ( ~ 1 ’ b ’ b *' l [B*(T n )]. 

Hence, letting n . — » oo, for each (1, 6, 6*) 

V e (l,b,b*) - C7 c,tl (l,6,6*) 

Using the scaling relation again, we get 


V" 


c 


1 } -U' 


■CM 


(50) 


Xb*Y' b* ’’ 

Observe that we can also find a bound /? such that 

p^V d (t)^U GM (t), for all t^O 


as the difference equals C — C for t ^ to and is bounded for the compact interval 
[0, to]. This directly yields /? as a bound on V c (t , b , 1) — (t, b, 1) which shows 

that the left hand side of (50) remains uniformly bounded. This yields (47) with 
K = 0/a and finishes the proof. □ 

Remark 5.8. As regards the limiting case when we define p in (44) by replacing 
t\ by the critical value to, we conjecture that we obtain E[p 3 ] = oo. But we were 
not able to prove this result. 


6. A pointwise version of one of Davis’ inequalities 

The value function V allows to derive a pointwise version of the Burkholder- 
Davis-Gundy inequality (1), which holds true in an almost sure sense rather 
than in expectation as stated in (1). This line of argument, inspired by the idea 
of robust superhedging from mathematical finance, is well-known (see e.g. [5] 
and [4]). 

Theorem 6.1. Denote by V = V c the value function (2) associated to the 
optimal constant C and consider the Brownian motion B = {B(t)) t ^o with its 
(right continuous, saturated) natural filtration (J-(t)) t ^o- 

There is a predictable process H(t ) satisfying E H 2 (t)dt\ < oo, for each 
T > 0, given a.s. by 

H(t) = V b (t,B(t),B*(t)) (51) 

for Lebesgue almost all t > 0, such that, for every bounded stopping time t, 

t5 - CB*(t) sS f H{t)dB{t ), a.s.. (52) 

Jo 
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Before giving the proof we observe the well-known fact that (52) trivially 
implies (2) by taking expectations on both sides of (52). 

Proof. Lemma 2.5 states that the continuous process 

X(t) = V(t,B(t),B*{t)) 

is a super-martingale, starting at A'(0) = V (0,0,0) = 0. sBy Doob-Meyer we 
may decompose X as 

X = M - A (53) 

where M is a continuous local martingale and A is a continuous non-decreasing 
predictable process, and M( 0) = A(0) = 0. 

In fact, M is a square integrable martingale as we will show in Lemma A.l 
in the appendix. 

By martingale representation we may find a predictable process H with 
H(t) 2 dt ] < oo, for each T > 0 such that 

M(t) = f H{u)dB(u). 

Jo 

By applying Ito to both sides of (53) we obtain the relation (51) which 
must hold true, for P-almost each u> and for Lebesgue almost all t (the null set 
depending on w). The formal application of Ito’s formula can be justified using 
the result in Lemma 5.2. □ 


7. BDG-Inequalities for general 0 < p < 2 

The above procedure can be easily modified to obtain similar results for the 
inequalities E[r§] ^ C p E[(B* (r)) p ] with 0 < p < 2. Lemma 2.2 and 2.4 stay 
essentially the same, with a different scaling given by 

V(a 2 t,ab,ab*) = aFVit.bX). 


This leads to the PDE 

2 tV t + bV b + b*V b * = P V 

and the OIDE 

rCO 

2 tVt(t) = pV(t) + [V(t + s) — V r (t)] 5 (s)ds 

Jo 


for 0 ^ t ^ to an< J the starting condition V(t) = t 'i — C for t^t 0 . 

In principle a similar analysis as in the present paper should provide explicit 
numerical values C(p) and to(p), in dependence of 0 < p < 2. We leave this task 
to future research. 

On the other hand, for p > 2 the present method does not seem to apply and 
some new idea is needed. 
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8. Relation to the Burkholder constant C = \/3 

In this section we consider martingales also allowing for jumps and we focus 
(wd.o.g.) on martingales (M„)£L 0 defined on a finite probability space fl (see 
Lemma 8.2 below). The BDG inequality (1) reads in this context as 

E[[M,M] h N ]^CE[M*], (54) 

where [M,M] n = 2" = i(Mj — ALj-i) 2 denotes the quadratic variation process. 
It was shown by D. Burkholder [7] that in this context the sharp constant C 
equals C = y3- 

One may ask for a deeper reason why we obtain a different sharp constant in 
(1) for continuous martingales as for martingales also having jumps. One reason 
is that the value function V fails to have a certain concavity property. 

Fix a point d = (t,b,b*) e D as well as a > 0,/3 > 0. Define the points 
d a , dp e D by 

d a = (t + a 2 ,b + a, max(6*, |b + a|)), 
dp = (t + /3 2 ,5-/3,max(6*, \b — /3|)). 

We also define p = and q = . 

Proposition 8.1. There exist x e D as well as a > 0, /? > 0 such that 

V(d) < pV(da) + qV(dp). (55) 

Of course, we could verify the above proposition in a trivial way by numeri¬ 
cally analyzing the function V ( t , b , b*) and detecting explicitly some d , a and /3. 
It is also clear where we should search for such a “bad” triple (x,a,{3), namely 
in a neighborhood of the “kink” related to the “non-smooth pasting” (Figure 1 
and 2) which displays a strong form of non-concavity. 

But this is not our point. The purpose of the above statement is to show 
how the non-concavity (55) of the value function V is related to the difference 
between the case of continuous martingales and the case of martingales with 
jumps. 

Also note that the equations V t + = 0 in the interior of D and V b * = 0 

on the non-stopping boundary of D (i.e. (23) and (15) above), imply that in 
the (properly interpreted) case of infinitesimal increments a and /3 we do have 
a in (55) above. This is the message of Lemma 2.5. 

Proof of Proposition 8.1. Admitting the subsequent lemma, we consider a dyadic 
martingale (M n )^ =0 starting at M 0 = 0. 

Let us fix some notation: The underlying probability space is given by 

Q = {(wi,..., wjv) '■ oj n £ { — 1,1}} 

and the filtration (P n )^ =0 is given by JF n = cr(u>i,... ,oj n ). Consider the process 


(*x =1 


(V([M,M\ n , M n ,M*))™ = o 
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where V = V c is the value function (2) associated to the optimal constant 
C 1, 27267 ... for continuous processes. 

It may happen that (X n )^ =0 is a super-martingale. In this case 

0 = X 0 ^ E [V{[M, M] n , M n , M*)] 

so that we obtain the inequality 

E[[M,M]|] < CE[M*]. 

However, we know that C ss 1,27267... is smaller than the sharp constant 
C = y/Z for martingales with jumps so that there must exist some dyadic 
martingale (M n )^ =1 such that the corresponding process (X n )^ =l fails to be 
a supermartingale. 

This means that there is some 0 ^ n ^ N — 1 and a/"' = (aq,... ,oj n ) such 
that - with slight abuse of notation - we find 

d := ([M,lH] B (uW),Af B (uW) 1 Jf n *(uW)) 


as well as 

d a = ([M,M]„ +1 ( W W,1),M„ +1 (J”),1),M: +1 (^,1)), 
dp = ([M,M]„ +1 M"),-1),M„ +1 ( W ("),-1),M* +1 ( W (”),-1)). 

such that inequality (55) holds true. □ 

For the following Lemma recall that a martingale is dyadic if the increment 
M n+ 1 — M n can attain at most two values, conditionally on a(Mi, ..., M n ),. 

Lemma 8.2. For a constant C > 0 the following are equivalent: 

(i) Every dyadic martingale (M n )^ =0 satisfies (54). 

(ii) Every martingale (M n )(( =0 defined on a finite probability space satisfies 
(54). 

(Hi) Every L 2 bounded martingale (M t ) o^t^T satisfies (54). 

Proof. The equivalence (ii) <=> (iii) is standard but for the convenience of the 
reader we will recall the argument for the non-trivial implication (ii) => (iii). 

First, we can reduce the problem to discrete L 2 -martingales: Fix an L 2 - 
bounded martingale M = (M t ) 0 ^t<i, based on a filtered probability space 
(fl, J 7 , (^ r t)o^tsST,P) and consider the martingales (M fe2 -n)^ =0 for n e N. If they 
fulfill (54), then letting n —» oo yields that M satisfies (54). 

Now, fix an L 2 -bounded martingale M = (M n ) 0 s£n^iv on a filtered probability 
space (fi, J 7 , (J- n )o<in^N,P) ■ Consider the net of finite subfiltrations (Xnfo^n^N 
of this filtration and their associated martingales = (M%)(( =0 (i.e. = 

E[M n \Q n ]). By (ii), every satisfies (54). The L 2 limit of is M and (iii) 
follows. 
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The implication (ii) => (i) is trivial. 

To show (i) => (ii) first observe that without loss of generality we can assume 
Mq to be deterministic. First, we can translate the martingale such that it 
has mean 0. Second, if M 0 is random, then define a martingale with 

Mq := 0 and M[ := Mj_i for i > 0. Then [M',M']jv +i = [M,M]n and 
M'* — M* 

Now, suppose first that (M„)„ =0 is just a one step martingale on a finite 
probability space f 1 with Mo = 0. We then have that Mi is a finitely valued 
random variable with E[Mi] = Mq = 0. 

By possibly passing to a bigger (still finite) f 1 we may find a partition 
(Ai ,..., A p ) of O such that Mi takes at most 2 values on each Aj and 

E[Mil A J = 0, j = l,...,p. 

We now define a dyadic martingale (M ; )^ =0 by M 0 = 0 and 

M 0 - Mj -1 := j = 1,... ,p. 

Clearly the variables ([M, M]i,M*) and ([M, M] p ,M*) are equal in law. This 
shows that, at least for N = 1, we may associate to every finitely values mar¬ 
tingale (M„)* =0 a dyadic martingale (Mj)^ =0 such that (54) holds true for M 
if and only if it does so for M. 

It is rather obvious how to continue the above construction in an inductive 
way so that we may associate to each finitely valued martingale (M n )!^ =0 a 
dyadic martingale (Mj)^ =0 such that ([M, M]jv,Mj^) and ([M,M] p ,M*) are 
equal in law. This readily shows (i) => (ii). □ 


Appendix A: The Martingale Property of the Value Process and 
Square Integrability 

Proof of Lemma 2.5. We denote the function appearing on the right side of (5) 
by v(t, b*): 

v(t, b*) := fs — Cb*, t,b*> 0. 

For fixed ( t,b,b *) e D and t ^ T we denote by V T (t,b,b*) the value function 
defined similarly as in (2), but where we only allow for stopping times r e T(t) 
which are bounded by T. Clearly V T (t, b , b*) increases to V(t, 6, b*), as T —» oo, 
pointwise for ( t,b,b*) e D. Also note that V(T,b,b*) = v(T,b*). Fix ( t,b,b *) 
and a bounded stopping time t < a < r. We have to show that 

V(t,b,b*) =R {t ’ b ' b * ) [V(a,B(a),B*(a))] (56) 


By the monotone convergence theorem it will suffice to show that 
lim V T (t,b,b*)= lim ^ b ' b *\V T {a T 7 B(a T ) 1 B*(a T )] 


(57) 
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for <j t := a a t t where r T is the stopping time defined conditionally on (t, b, b*) 

by 

t t = inf{u ^ t : V T (u, B(u), B*(u)) = v(u , B*(u))}. 

We then have that r 1 is bounded by T and increases a.s. to r. The crucial 
property is 

V T {t , b, b*) = E( t ’ M *)[u(r T , B*{t t ))], 

and, more generally, for any stopping time t =% p < r T , 

V T { Pl B(p),B*(p)) = E[u(t t , B*{t t ))\T{ p)]. 

This classical result can be found in [13, Theorem 2.2]. Putting this together 
and taking p = a T , we obtain (57). 

The proof of the supermartingale property which still holds true, after time 
t is identical with an inequality instead of an equality. 

a 

We can even show that the value process is bounded in L 2 up to some fixed 
time T: 

Lemma A.l. The supermartingale (X(i))osgt^T given by 

X(t) := V(t,B(t),B*(t)) 

is uniformly bounded from above and bounded in L 2 . Furthermore the martingale 
component of its Doob-Meyer decomposition X(t) = M{t)~ A(t) is also bounded 
in L 2 and we obtain the following quantitative estimates for every stopping times 
cr with 0 ^ a < T: 

(i) X(a)^Ti, 

(ii) E[X(ct) 2 ] < KT, 

(Hi) E[M(cr) 2 ] ^ E[M(T) 2 ] < oo. 

Proof. We first observe that V(t, 6, \b\)—(ti—C\b\) ^ V(0,b,\b\)—(—C\b\) ^ C\b\ 
holds as a consequence of the proof of Lemma 2.4 which gives us the estimate 
V(t,b,\b\) < t? . Letting t —» 0 this also imples V(t, 0,0) ^ t 5 and we can show 
(i) by 

V{t, 6, &*) < V(t, 0 , &*) < V{t, 0,0) < ta . 

V is monotone increasing in t and monotone decreasing in \b\ and b*. So we can 
observe for the positive part of X that 

E[(A(a) + ) 2 ]^(l/(T,0,0) + ) 2 ^T. 

For the negative part X(t)_ we can use V(t,b,b*) ^ ti — Cb* ^ — Cb* and 
estimate 

E[(X(a)_) 2 ] < C 2 E[B* (T) 2 ]. 
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In summary we have 

E[X(cr) 2 ] T + C 2 E[B*(T) 2 ]. 

To show the last assertion, we now split X into a sum of bounded processes 
in the following way. Define the stopping times (cr„)® =0 by 

a n = M{t:\B(t)\=2 n } aT, 

and define the processes X n , obtained by starting X at time <t„_i and stopping 
it at time a n : 


X n (t) = = (x tAan 

Of course, we have X = Ym=i X n and the trajectories of (X„(t)) 0s gt^T are only 
different from zero on the set ({er„_i < T})® =1 . 

The probability of these events can be estimated by 

PK_! <T] < Cl e- C222 ", (58) 

for some constants c± = C\(T) and C 2 = c?(T). 

Using a classical inequality on uniformly bounded supermartingales (appar¬ 
ently due to P. Meyer [12]) we obtain that each M n is a square integrable 
martingale whose norm can be estimated by 

[|M n (T)|| 2 2(p) ^ c 3 2 C4 "P[cr ra _ 1 < T] (59) 

for some constants c 3 ,C 4 depending only on T. 

Combining (58) and (59), we deduce that 

00 

l|M(T)||| 2(P) = ^ l|Afn(T)||i a(P) <oo. 

n=l 


□ 

For the convenience of the reader we spell out the message of Meyer’s Theo¬ 
rem [12, Theorem 46] in the present context. 

Theorem A.2 (Meyer). Let X = (X(i))o^t^T be a uniformly bounded super¬ 
martingale 

IIAIloo^ sup ||X(t)||i,oo c < 00 . 

OsStsST 

Denoting by X = M—A its Doob-Meyer decomposition we get that M is a square 
integrable martingale whose norm can be estimated by ||M||| := ||M(T)|| 2 2 ^ 
18c 2 

Proof. By standard approximation results it will suffice to show the result for a 
super-martingale X = (X(n))^ =0 in finite discrete time. Note that in this case 
we have A(0) = 0 and 


A(n + 1) — A(n) = E [X(n + 1) — A'(n)|J r (n)], n = 1,..., N 
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so that 

||A^4(n)||x,oo < 2c. 

We may telescope A(N) = Ym=i(A(n) — A{n — 1)) to obtain 

A(N) 2 = 2 f (A(n) A(n - 1)) f] (A(j) - A(j 1)) 

n=1 j=n 

N 

= 2 (A(n) - A(n - 1))(A(7V) - A(n - 1)). 

n= 1 

By taking expectations we get 
N 

E[A(A) 2 ] = 2 ^ E[E[(A(n) - A{n - 1 )){A{N) - A(n - 1))| T{n - 1)]] 

n= 1 
N 

= 2 2 E[(A(n) - A(n - l))E[A(iV) - A(n - 1 )\T{n - 1)]]. 

n= 1 

The final term is uniformly bounded as 

E[A{N) - A(n - 1)| F(n - 1)] = E[X(N) - X{n - 1)| T(n - 1)] 2c. 

This yields 

N 

E[A(A) 2 ] s? 4c ^ E[A(n) - A(n - 1)] 

n= 1 

= 4cE[A(7V) - X(0)] ^ 8c 2 . 

To obtain a bound for ||M|| 2 we use the relation M = X + A and ||X(lV)||£<x) < c 
to get 

E[M(7V) 2 ] ^ 2E[A(1V) 2 ] + 2c 2 ^ 18c 2 . 


□ 


Appendix B: Some facts on the stopping time of first leaving a 
corridor 

We discuss the first exit time of the interval [—h, 2 + h] for some h > 0 for a 
standard Brownian motion B started at B(0) = 0. 

a h := inf{f : \B(t) — 1| = 1 + h}. 

This stopping time has a well-known density and a well-known Laplace-Transform 
C (see e.g. [11, Section 2.2.8.C] given by 

= cosh(V2fl) 
cosh((l + h)V2fl) 
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One can calculate the expected value of the stopping time by noting that 
= E[B^ h ] so that Efcr 71 ] = h{2 + h). The Jensen-inequality directly 
implies that the fractional moments of order less than 1 also exist and one 
has the estimate E[(<7 h )s] ^ (/i(2 + /i))s. This motivates to conjecture that 

^ 2 J —» oo for h —* 0. We can get an expression for this moment using the 
Laplace-transform above by the formula E[(o- h )s] = J® 6~^L\0) dO, which 
we can evaluate to 




f 

JO 


sinh (hu) 


u cosh((l + h)u)‘‘ 


du + h 


f 


cosh(u) tanh((l + h)u) 
u cosh((l + h)u) 


du 


where a substitution u = \f29 was used. It can be shown that both of these 
integrals are in fact finite for positive h, but we are only interested in the above 
limiting behavior of this expression. To see this first note that the integrands of 
both integrals are always positive. Furthermore the hyperbolic tangent converges 
to 1. Therefore we can fix a constant K such that for h < 1 we have that 
tanh((l + h)u) ^ \ for u ^ K. Putting this together we make the following 
estimate: 

E[(cr ,l )3] > 1 f 00 cosh(u) 

h ^ V27r Jk ucosh((l + h)u) 

l f® e~ hu , 1 f® e~ u , 

^ —-= - du = —- du. 

2V2tt J k u 2V27T J hK u 

The last expression now obviously diverges for h —> 0. The following Lemma 
which we will need later on uses the above observations: 

Lemma B.l. Let K > 0 be an arbitrary constant. There exist t, h > 0 such 
that 

E[(f + cr' 1 )! - is] > Kh. 

Proof. By the above observations we can choose h small enough to obtain 
EUV^s] > Kh+1. We also have E^)?] - E[(i + cr h )^ is] < 1 for small 
enough t by monotone convergence of (t + s)s — ii to ss for i —» 0. Q 

We can now proceed to show the following facts about to to prove the final 
statement of Lemma 2.4. 

Lemma B.2. Let C ^ C and V(t, b , b*) the corresponding value function. The 
map 1 1 —> V(t, 1,1) — (is — C) is decreasing and if V{1, 1,1) > 1 — C it follows 
that V(l, 1, 1) - (1 - C) < V(0,1, 1) - (-C). 

Proof. We need a quantitative version of Lemma 2.1 (ii) which already shows 
that f V(t, 1,1) — (is — C) is decreasing. First choose some v ^ 1 to be a 
bounded stopping time which achieves 

V(l, 1,1) - E (1,1,1 ^[iss - CB*[y )] < e 


( 60 ) 
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such that 

F[u > 1 + 2e] > (2e)5 (61) 

for arbitrary e > 0. It is clear by definition of V that there exists a stopping 
time which satisfies (60). Suppose there is no appropriate stopping time such 
that (61) is satisfied, then there is an optimizing sequence of bounded stopping 
times which converge to 1 in probability and thus a subsequence which converges 
almost surely. This would imply that 1/(1,1,1) = 1 — C which contradicts the 
assumptions of the Lemma. 

Now, we can consider the stopping time v as a randomized stopping time 
with respect to the filtration (Q^(u )) u ^The shifted stopping time v' := v— 1 
is then a randomized stopping time with respect to (G^ {u)) u ^ o- We can now 
estimate 

[V (0,1,1) — (—C)] — [V (1,1,1) — (1 - C)] 

E(°d,i)[y| _ CB*(v')} - E( 1 > 1 ’ 1) D'* ^ CB*(v)] + 1 - e 

= E[( l /)s-(l + I /)5 + l]-£ 

^E[((i/') l -(l + ^) l + l)^'>2 J-e 
^ [(2e)i - ((l + 2e)3 - l)](2e)* -e 
> £ - y/2e% > 0. 

To get from the second to the third line, we used that by definition v = v' + 1 
and El 0,1,1 )[.B*(i/)] = E( 1,1 ’ 1 )[U*(^)]. We dropped the superscript to empha¬ 
size that we now view v' as a stopping time with respect to the filtration 
(t/f°)(it))u>o- To obtain the fourth and the fifth line in the derivation, we observe 
that the map — (l + i) 5 +lis non-negative and non-decreasing, and use 

(61). The sixth line can be derived by noting that s >-» (1 + s) 2 — 1 is concave 
and thus lies completely under its tangent at s = 0. The last inequality holds 
for e small enough. In the same way one can actually show that as long as the 
spread V(t, 1,1) — ( 1 2 — C) is strictly positive, it is also strictly decreasing. □ 

Lemma B.3. Let C ^ C and to = io(C) the critical point separating S from 
NS. Then 

1. to > 0 and 

2 . to < 00 . 

Proof. (1): Assume to = 0. This means that we actually have 

V{t,b*,b*) = t* - Cb* 

for all t > 0. We then obtain for arbitrary t, h > 0 that 

El*- 1 - 1 )[( 0-^)2 - C( 1 + h)] < (t)i - C7 


by the supermartingale property of the value-process. This is a contradiction to 
Lemma B.l for small enough t and h. 
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(2): Assume t 0 = oo. This means that V(t, 1,1) > — C everywhere. As 

C ^ C we also have V(0,1,1) ^ 0. By Lemma B.2 we can set a := V(l, 1,1) — 
1 + C < C. Now fix some h > and t > (1 + h) 2 . We can then make the 
following estimate, where we use twice the fact that the function 1 1 -» V(t,b,\b\) — 
(t? — C\b\) is decreasing and a h is defined as before: 

V{t, 1,1) = E( t ' 1 > 1 )[V(<7 h ) 1 + h, 1 + h)] 

< V{t, l + h,l + h) + E (t,1,1 ^[((T ?i )5 - ts] 

= (i + h)V (j^2, 1,1) + - ti] 

< i* - (1 + h){C -a)+ E^’ 1,1 ^[((j fc )5 - ti] 

< V(t, 1,1) - A(C - a) + a + - *s] 

Now we can eliminate V(t, 1,1) on both sides and note that h, C and a do 
not depend on t. The last term however goes to 0 for t —* oo by dominated 
convergence (since 0 < \Jt + a h — y/t ^ yfcfi and \Jt + a h — y/t\0). This leads 
to the desired contradiction. □ 
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